Skip to content

build: Compile dataset helper#2412

Merged
ko3n1g merged 1 commit intomainfrom
ko3n1g/build/compile-helper
Feb 17, 2026
Merged

build: Compile dataset helper#2412
ko3n1g merged 1 commit intomainfrom
ko3n1g/build/compile-helper

Conversation

@ko3n1g
Copy link
Copy Markdown
Contributor

@ko3n1g ko3n1g commented Feb 17, 2026

What does this PR do ?

Without this PR, the compiled file helpers_cpp.cpython-312-x86_64-linux-gnu.so is missing

Changelog

  • Add specific line by line info of high level changes in this PR.

GitHub Actions CI

See the CI sectionin the Contributing doc for how to trigger the CI. A Nvidia developer will need to approve and trigger the CI for external contributors.

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

  • Related to # (issue)

Summary by CodeRabbit

Release Notes

  • Chores
    • Updated build infrastructure to improve package dependency handling during container setup.

@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot bot commented Feb 17, 2026

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@ko3n1g ko3n1g marked this pull request as ready for review February 17, 2026 18:47
@ko3n1g ko3n1g requested a review from a team as a code owner February 17, 2026 18:47
@ko3n1g
Copy link
Copy Markdown
Contributor Author

ko3n1g commented Feb 17, 2026

/ok to test eeac91e

@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai bot commented Feb 17, 2026

📝 Walkthrough

Walkthrough

Adds a RUN command to the Dockerfile that installs the Megatron-LM package from the 3rdparty directory in editable mode without dependencies during the Docker image build process.

Changes

Cohort / File(s) Summary
Docker Build Configuration
docker/Dockerfile.ci
Adds a pip install -e command to install Megatron-LM package from /opt/Megatron-Bridge/3rdparty/Megatron-LM in editable mode with --no-deps flag during image build.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~8 minutes

Possibly related PRs

  • cp: feat: Add dataset compile helper (#2236) #2249: Related to how the local 3rdparty/Megatron-LM package is installed and integrated, with this PR adding Dockerfile pip install-e while the other enables editable source in pyproject and runtime compilation.

Suggested reviewers

  • chtruong814
  • thomasdhc
  • maanug-nv
🚥 Pre-merge checks | ✅ 4
✅ Passed checks (4 passed)
Check name Status Explanation
Title check ✅ Passed The title 'build: Compile dataset helper' is specific and clearly related to the main change: adding a build step to install/compile the Megatron-LM dataset helper in the Docker CI image.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Test Results For Major Changes ✅ Passed Minor 4-line Docker build configuration change that does not affect model numerics, convergence, performance, or introduce significant refactoring. Test results not required for minor build infrastructure changes.
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing touches
  • 📝 Generate docstrings
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Post copyable unit tests in a comment
  • Commit unit tests in branch ko3n1g/build/compile-helper

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link
Copy Markdown
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@docker/Dockerfile.ci`:
- Around line 58-59: The RUN line in the Dockerfile is passing arguments in the
wrong order so -e treats --no-deps as its path; update the pip install
invocation in the RUN that targets /opt/Megatron-Bridge/3rdparty/Megatron-LM so
that --no-deps appears before the editable flag (e.g., use "--no-deps -e .") or
otherwise ensure -e's argument is the project path, keeping the same directory
and installation intent.

@ko3n1g ko3n1g added the r0.3.0 Cherry-pick label for r0.3.0 release branch label Feb 17, 2026
Signed-off-by: oliver könig <okoenig@nvidia.com>
@ko3n1g ko3n1g force-pushed the ko3n1g/build/compile-helper branch from 7228528 to 28d7381 Compare February 17, 2026 19:57
pengdurice pushed a commit to pengdurice/Megatron-Bridge that referenced this pull request Feb 24, 2026
Signed-off-by: oliver könig <okoenig@nvidia.com>
Signed-off-by: pengdurice <pengduhit@gmail.com>
copy-pr-bot bot pushed a commit that referenced this pull request Mar 19, 2026
Signed-off-by: oliver könig <okoenig@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

r0.3.0 Cherry-pick label for r0.3.0 release branch

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants